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On  functional  estimates  for  ill-posed  linear  problems 

R.  Brigola*^  and  A.  Keller 

Abatract;  Ill-posed  linear  problems  in  Hilbert  space  are 
considered  as  stochastic  filtering  problems.  Functional 
estimates  of  the  signal  x  are  given  for  the  problem 
Ax  -  y  =  z  where  A  is  a  linear,  not  necessarily  bounded 
operator  between  Hilbert  spaces  and  x,  y,  z  are  Hilbert 
space  valued  random  elements  .  As  an  application, 
functional  estimates  are  given  explicitly  for  Radon  trans¬ 
formed  signals  with  additive  white  noise. 

1,  Introduction:  Let  Hj  and  H2  be  Hilbert  spaces  and  A:Hj  - >  H,  a  linear 

operator.  By  Hadamard’s  definition,  a  linear  problem  Ax  =  z  is  well-posed  if 
a  solution  exists,  is  unique  and  depends  continuously  on  the  data  z « H2_ 
Otherwise  it  is  called  ill-posed. 

Deterministic  regularization  methods  for  ill-posed  problems  have  been 
extensively  treated  in  literature,  starting  from  the  work  of  A.  N.  Tichonov 
and  V.  Ya.  .Arsenin  [12].  Further  references  may  be  found  in  [6]  or  [7],  for 
instance.  Since  an  equation  Ax  =  z  often  decribes  a  functional  relationship 
between  an  unknown  state  x  and  an  observation  z,  which  may  be  affected 
with  random  additive  noise,  iil-posed  problems  have  also  been  considered  as 
stochastic  filtering  problems  Ax  +  y  =  z ,  where  x,  y,  z  are  Hilbert  space 
valued  random  elements.  For  bounded  linear  transformations  A,  defined 
everywhere  in  Hj,  stochastic  solutions  for  these  equations,  depending  on 
various  models  for  the  noise  v,  have  been  studied  in  the  work  of  J.  N. 
Franklin  [2],  V.  Friedrich  and  A.  Uhlig  [3], 

In  this  work,  we  will  consider  linear  transformations  A  which  are  not  necessa¬ 
rily  bounded  or  defined  everywhere  in  H  ,  and  obtain  estimates  for  a  given 
class  of  linear  functionals  of  the  signal  x,  given  an  observation  z  =  Ax  +  y, 
where  v  is  a  noise  on  H?  with  positive  definite  covariance.  Signal  and  noise 
will  be  assumed  to  be  zero-mean,  Gaussian,  weak  random  variables  on  Hj 
resp.  H2. 

►)  The  first  author's  reseaj-ch  has  been  supported  by  AFOSR  Contract 
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2.  Dj- best,  linear  estimates  in  an  operator  class  A 


Let  Hs  and  H2  be  Hilbert  spaces  and  A:  DA - *  H2  be  a  linear  operator  with 

domain  of  definition  D  c  H  ,  and  A':  D  - >H  be  its  formally  adjoint 

operator,  i.  e.  <Ax,y>  =  <x,A'y>  for  x  «  DA  and  y  «  DA>  (  cf.  [13]  ). 

Here  <.,.>  denotes  the  inner  product  in  Hj  resp.  H2- 

Furthermore,  let  the  signal  x  be  a  zero-mean,  Gaussian,  Hj-valued  weak 
random  variable  ( cf.  [l])  with  covariance  operator  B,  and  the  noise  y  be  a 
2ero-mean,  Gaussian,  H2*valued  weak  random  variable  with  positive  definite 
covariance  operator  C.  Signal  and  noise  are  assumed  to  be  independent,  i.  e. 
to  have  independent  finite  dimensional  distributions. 

Given  an  observation  z  =  Ax-v,  we  look  for  a  functional  h2  «  H2  of  the  obser¬ 
vation  z  to  estimate  a  given  functional  <x,g>,  gtH^,  of  the  signal.  More 
generally,  given  a  subspace  Dj  of  H  ,  we  look  for  an  estimation  operator  L: 

Hj  - H.,  transforming  the  given  functionals  in  D  into  functionals  in  H„, 

such  that  <z,Lg>  is  a  least  squares  estimate  for  <x,g>  simultaneously  for  all 
g«D3-  Therefore  we  define: 

Definition  (2.1): 

Let  i  c  ]L:  H]  — *  H2  ]  L  linear  and  Dj  c  D  }  be  a  given  class  of  admissible 
estimation  operators. 

1)  For  g«D  ,  the  risk  rg(L)  of  L<A  is  given  by 

rg(L)  :=  E  ( i<z.Lg>  -  <x.g>!2  )  . 

2)  LQ  «  A  is  called  Di  -best,  linear  estimator  for  x  in  A  if  simultaneously  for 
all  g  (  Dj  and  L  «  A  it  holds: 

VLo>  5  rg(L*  ‘ 

It  will  become  clear  from  the  following,  that  certain  restrictions  have  to  be 
imposed  on  the  class  of  admissible  estimation  operators  to  obtain  well-de¬ 
fined  domains  of  definition  for  the  estimations,  since  we  deal  with  un¬ 
bounded  operators,  generally  not  defined  everywhere  in  the  Hilbert  spaces 
H,  or  H2. 

Now,  let  T  ••=  BA'tABA'  -  C)  1  on  its  natural  domain  of  definition.  If  we 
denote  the  covariance  operator  of  z  by  K;=AB.V  +  C,  then  K  is  one-to-one, 
since  C  is  positive  definite  and  B  is  positive  semjdefinite.  Therefore  T  is 
well-defined  and  for  its  domain  of  definition  Dr  it  holds: 

Dr  =  rg(K)  =  K(DABA.  ). 

since  DK  -  DaBa.  «  DBA.  .  Also,  P  =  K*'AB  =  (AB.V  -  CT'aB  is  well-de- 
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fined  on  its  natural  domain.  Here  rg(K)  denotes  the  range  of  K. 

Lemma  (2.2): 

Let  rg(A’)  c  c  Dr. .  Then  the  following  statements  hold: 

D«  c  Dab 

2)  B(  Dj )  c  Da 

3)  ABiDj)  c  rg(K) 

4)  D  =  D 

K  A- 

5)  T  is  formally  adjoint  to  T’lD 
Proof: 

The  statements  1)  -  3)  follow  immediately  from  the  assumption  Dj  c  Dp  . 

Statement  4)  follows  from  1)  and  the  assumption  rg(A‘)  c  D  ,  and  5)  from 
the  definition  of  T  and  P. 

Theorem  (2.3): 

Let  P  :=  (ABA,  +  C)  1  AB  and  rg(A')  c  Dj  c  Dp  . 

Then  P  is  the  Dj-best,  linear  estimator  for  x  in 


A  :=  {  L:  D, 


D  j  L  linear  and  rg(ABA'  +  C)  c  DL.  } . 


Proof: 


First,  we  state  that  P  «  A: 

P  is  defined  on  D}  and  by  Lemma  (2.2),  3)  and  4)  we  obtain; 

P(Dj)  =  K  1  ABtDj)  c  Dk  =  Da<  .  The  formally  adjoint  T  of  P  is  defined  on 
rg(K),  thus  P  (  A. 

The  risk  of  an  arbitrary  L«  A  for  fixed  g  t  Dt  is  given  by 
rg(L)  =  E( !  <x,g>  -  <z,Lg>  i2)  =  E( !  <x,  g-  A'Lg  >  I2) 

-  E(  <x,  g- A'Lg>  <y,Lg!>)  -  E(  <x,  g- A'Lg>  <y,Lg>)  +  E(i<y,Lg>  i2) . 
By  the  assumption  that  x  and  y  are  zero-mean  and  independent,  we  have 
rg(L)  =  <Bg.g>  *  <BA'Lg,A’Lg>  -  <BA'Lg,g>  -  <Bg,A'Lg>  ♦  <CLg,Lg>  . 

Since  by  Lemma  (2.2),  rg(A')  c  D,  c  D4_  and  AB(D  )  c  rg(K),  and  since 
rg(K)cD  .  we  obtain: 

rg(L)  =  <Bg,g>  ♦  <ABA'Lg.Lg>  *  <CLg,Lg>  -  <BA'Lg,g>  -  <L'ABg,g> 

=  <Bg.g>  *  <L'KLg,g>  -  <BA'Lg,g>  -  <L'ABg,g>  . 

Thus,  comparing  the  risk  of  L  with  the  risk  of  T,  it  follows: 

rg(L)  -  rg(P)  =  <L’Klg,g>  -  <nCPg,g>  -  <BA'Lg,g>  -  <L‘ABg,g> 

-  <BAT'g,g>  +  <TABg,g>. 

Using  FK  =  BA'  on  D  =  D  and  KP  =  AB  on  D  ,  we  have: 
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rg(L)  -  rg(r-)  =  <L'KLg,g>  -  <L-Krg,g>  -  <nag,g>  ♦  <ntrg.g> 

.  =  <(L,-r)K(L,-rvg,g>  ; 

eventually  we  have  rgd")  *  rg(L),  since  K  is  positive  definite. 

Remark  (2.4): 

The  operator  class  A,  which  we  have  used,  is  maximal  in  the  following 
sense: 

1)  For  an  element  L  <  A,  the  condition  c  DL  is  necessary  to  define  <z,Lg>, 
and  L(D  )  c  D  is  necessary  for  <x,A'Lg>  to  exist  for  all  g«D  . 

1  A  J 

2)  For  L  «  A,  the  existence  of  a  formally  adjoint  L‘  with  DL.  =  rg(K)  is  used 
to  ensure  comparability  of  the  elements  of  A  with  respect  to  the  risks  rg, 
g‘D,- 

3.  Application:  Functional  estimates  for  noisy  Radon  transformed  signals 
A  well-known  ill-posed  linear  problem  is  Radons  integral  equation  ( cf.  [10]). 
The  Radon  transform  has  technically  been  used  in  Computational  Axial 
Tomography  for  the  reconstruction  of  a  density  function  from  its  integrals 
along  hyperplanes.  This  application  motivates,  in  spite  of  known  inversion 
formulas,  a  stochastic  treatment,  because  one  has  only  finitely  many  data, 
which  additionally  may  be  affected  with  measurement  errors,  and  the  un¬ 
known  density  is  in  time  randomly  dependent  on  body  functions  of  the 
patient,  for  instance  slight  motions  during  measuring. 

We  will  use  the  following  notations. 

Definition  (31): 

Let  L'(Rn)  be  the  space  of  Lesbegue-integrable  functions  on  Rn,  n*2, 
f«  L1  (Rn),  Sn*‘  the  unit  sphere  in  Rn  and  H(p,q)  ==  {  x«  Rn  :  <x.q>  =  p}, 
(p.q)  e  R*Sn  l,  a  hyperplane  in  Rn. 

Then  the  Radon  transform  Rf  of  f  is  defined  by 

Rf(p,q)  :=  J  f(x)  m(dx) , 

H(p,q) 

where  dm  is  the  (n-l)-dimensional  Lesbegue  measure  on  H(p.q). 


According  to  [11],  the  Radon  transform  is  not  a  continuous  operator  on  the 
whole  of  L2(Rn),  the  Hilbert  space  of  square  integrable  functions. 

To  use  the  above  concept  of  functional  estimation  for  the  problem  Rx  ♦  y  =  z, 
we  will  make  the  following  assumptions: 

i)  Let  Hj  :=  L2(Rn)  and  H2  ==  L2  (S  x  Sn_1),  endowed  with  the  usual  inner 


products,  n>2  a  fixed  integer. 

ii)  If  f  (Rn)  denotes  the  Schwartz  space  of  rapidly  decreasing  functions  on 
Rn,  we  identify  f(Rn)  with  a  subspace  of  L2(Rn),  and  consider  the 
Radon  transform  R  as  operator  from  L2(Rn)  into  L2(RxSn~')  with 
Dr  =  ?(Rn),  (cf.  [8]). 

iii)  The  observation  z  is  assumed  to  be  z  =  Rx*y,  where  the  signal  x  is 
assumed  to  be  a  zero-mean,  Gaussian,  Hj-valued  random  element,  which 
is  stationary,  i.  e.  its  covariance  operator  B  is  given  by: 


<g,Bh> 


-n 

Rn  Rn 


b(i!q  -  sli)  g(q)  h(s)  dqds  ,  g,h<  f(R  ), 


for  some  b  «  f  (R),  ( cf.  [5  ] ). 

iv)  The  noise  y  is  assumed  to  be  zero-mean,  Gaussian  white  noise  on 
L2(R*Sn~'),  i-  e.  a  weak  random  variable  with  covariance  operator  o2I, 
with  I  the  identity  operator  on  L^RxS”'1). 

v)  The  signal  and  the  noise  are  assumed  to  be  independent. 

vi)  Let  the  class  Dj  of  functionals,  estimates  are  asked  for,  be  given  by 
?(Rn). 

We  will  need  the  following  definitions  and  relations  between  Radon,  Fourier 
and  Hilbert  transforms. 

Definition  (3.2): 


1)  Let  c  - 


2  (2r ) 


2)  The  multiplication  operators  Mk:  f{R>Sn'’) 


L2(R*Sn_I)  resp. 


M  •  ?(Rn) 

k 


L2(Rn)  are  defined  for  ki  1  by: 


Mkh(p,q)  ==  ,’pr'1  h(p,q) ,  (p.q)  «  RxSn_1  ,  hi  f(RxSn_1) 


Mkg(x) 


i)  i:xlik_1 


g(x),  x<  Rn,  g«  f(Rn) 


3)  <D(x)  ==  J bdsl!)  e"1<*-’c>  ds  .  x.Rn, 


i.  e.  <*>  is  the  power  spectral  density  of  the  signal  x  (cf.  [5]). 

Since  <J>*f(Rn)  and  depends  only  on  the  norm  of  its  argument, 
<p(r) '■=  <t>(i'xl ),  r  =  i; xl: ,  x  <  Rn,  is  well-defined.  Of  course,  <t>  and  are  non¬ 
negative. 


1. 


mmmmm 
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lemma  (3.6): 

For  the  covariance  operator  B  of  the  signal  x,  it  holds: 

B  =  7~l  M<j,  7  on  f  (Rn) 

Proof: 

For  given  g  «  f(Rn),  h«  L2(Rn)  we  have: 

<Bg,h>  =  f  J  bills  -  ql!)  g(q)  his)  dq  ds 
Rn  Rn 

=  (2 x)"n  J  J  b( I! s  -  q  I!)  JVg(u)  ei<u,q>  du  J IW)  e'i<v,5>  dv  dqds 

Rn  Rn  Rn  Rn 


=  (2~)~n  J  (  J(  j* 7 g(u)  4>(u)  ei<u’VlS>  du}ds)?h(v)  dv  , 

Rn  Rn  Rn 

where  the  last  equality  holds  by  Fubini's  theorem,  the  well-known  substi¬ 
tution  rule  for  integrals,  and  by  definition  of  <J>. 

Thus,  by  Lemma  (3.3),  2)  we  obtain: 

<Bg,h>  =  |  Fg(v)  $(v)fh(7)  dv  =  <M$,Tg  ,?h>  , 

R" 

which  proves  the  assertion. 

Now,  we  can  calculate  best  functional  estimates  for  functionals  in  ?(Rn), 
according  to  theorem  (2.3),  given  the  problem  Rx  -  y  =  z  under  the  above 
assumptions: 

Theorem  (3.7): 

1)  The  operator  P:  D^. - >  L2(R*Sn  1 ) ,  define!  by  P  :=  (RBR‘  +  c2I)  1  RB, 

gives  a  ?(Rn)-best,  linear  estimator  for  x  in  the  sense  of  definition  (2.1) 
within  the  class: 

A  :=  {  L:  f(Rn)  - »rg(VR)  |  L  linear  and  rg(RBR'  +  c2l )  c  DL.  }. 

Here,  V  is  the  operator  from  definition  (3.2). 

2)  For  (p.q) «  R *  SnM  and  g  «  f  (Rn),  it  holds: 

rg(p.q)  =  c J  £  Ts(r<1,'‘rP  ^ 

R 

Proof: 

Clearly,  s2I  (o  >  0)  is  positive  definite,  and  by  Lemma  (3.4): 
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Thus,  we  have  to  show  ?(Rn)  =  Dj  c  Dr  ,  according  to  theorem  (2.3).  Let  us 
state,  that: 

RBR'  *  o2l  =  RB(VR)"1  *  o2VR(VR)-1  =  R(B  +  co2T_1M  7)  (VR)_1 

n 

=  R?"1  (M  .  +  cc2  M  )7(VR)_1 

<S>  n 

by  Lemma  (3.4),  (3.5)  and  (3.6). 

Therefore,  we  have: 

i)  (RBR'*  o2I)VR(f(Rn))  =  RJ'1  (M  *  cc2  ft  )7(?(Rn)) 

C>  n 

Since  4>  t  f(Rn),  also  the  functions  hg  on  Rn,  defined  by: 


hg(x) 


4>(x) 


2ii  ,iin-l  S^x)  ,  g  *  f(R  )  , 


4>(x)  ♦  co  l!x 

are  elements  of  f(Rn). 

Hence,  we  obtain:  _ 

ii)  M-RlffR"))  c  (M  *cc2M  )?(f(Rn)). 

<X>  O  n 

Therefore,  application  of  R?  1  on  both  sides,  Lemma  (3.6)  and  i)  yield: 
RB(?(Rn))  c  RT1  (M-  *  cc2  ft  )7(f(Rn))  =  (RBR*  +  o2I)\lR(f(Rn)). 

n 

Thus,  we  have  shown,  that  RB(f(Rn))  c  rg(RBR‘  +  c2I),  which  implicates: 
t (Rn)  c  . 


Since  D  =  VR(f(Rn))  =  rg(YR),  the  class  d  equals  that  one  used  in  theorem 
(2.3),  from  which  we  now  obtain  assertion  1). 

To  prove  assertion  2),  remark  that  by  the  above  we  have  for  gt  f(Rn): 

T'g  =  (RBR'-  o2I)'JRBg  =  CRT-1  (M^  *  co2  ftn)7  (MR)*1  ]_1  RBg 
=  VRJ^M  *C32M  )_1TR-,RBg  . 

v  n 

Using  Lemma  (3.5)  and  (3.6),  we  get: 

T’g  =  cRT1  M  (M  *c32ft  f1  M  7 g  . 

Eventually,  the  representation  of  the  Radon  transform  in  Lemma  (3.3)  implicates 
for  (p,q)*R*Sn'\  g<  ?(Rn): 


rg(p.q)  =  (2r)5-  f  — £JIraJ*2Il < 

J  $>(rq)  ’  cc2  Lrqii 
R 


<b(rq) 


FT  ":n-i  fg^qle1^  dr. 


rgip.d  =  aj-'  J  Tg(rQ)e^ 


Remark  (3.8): 

1)  Thus,  the  theorem  says,  that  the  evaluation  J Pg(p.q)  z(p,q)  dpdq  of  the 
observation  z=Rx-y  gives  the  best  estimate  for  the  weighted,  total  den¬ 
sity  Jg(s)  x(s)ds:  the  local  density  x(s0),  sQ  <  Rn  can  be  approximated 


using  the  approximate  identity  as  weight  functional  g<f((Rn).  With  regard 
to  the  representation  of  I”  in  the  proof  of  theorem  (3.7),  the  estimation 
error  is  given  by: 


E(i<z,rg>  -  <x.g>|‘) 


=  J  4>(x)  (l  - 


- - ^2^ - £1  )*g(x)7g(x)  dx 

2(27r)n'‘  4>(x)-o2!;xl|n"' 


2)  If  the  Radon  transform  is  considered  as  a  mapping  between  weighted 
L2-spaces,  deterministic  regularizations  have  been  worked  out  by  A.  K. 
Louis  [7].  Using  the  above  stochastic  filtering  method,  for  this  case 
solutions  can  also  be  obtained  and  be  related  to  the  result  of  Tichonov 
regularization.  This  will  be  done  elsewhere. 

Acknowledgement: 

The  authors  want  to  express  their  gratitude  to  Professor  D.  Kolzow,  Univer¬ 
sity  of  Erlangen,  for  the  suggestion  of  the  subject  and  helpful  discussions. 
The  first  author  also  thanks  the  hospitality  and  support  of  Professor  G. 
Kallianpur,  University  of  North  Carolina  at  Chapel  Hill,  where  part  of  the 
work  has  been  done. 

References: 


[1  ]  Ba'.akrishnan.  A.  V.:  Applied  Functional  Analysis;  Springer,  New  York-Hei- 
delberg-Berlin,  1976 

[2]  Franklin.  J.  N.:  Well-Posed  Stochastic  Extensions  of  Ill-Posed  Linear 
Problems;  J.  Math.  Anal.  Appl.  31,  1970,  682-716 

[3]  Friedrich.  V.:  Uhlig.  A.:  Zur  stochastischen  Regularisierung  linearer 
Gleichungen  in  Hilbertraumen;  Beitr.  Num.  Math.  7,  1979,  33-48 

[4]  Gelfand.  I.  M.:  Graev.  M.  1.:  Vilenkin.  N.  Ya.:  Generalized  Functions,  Vol.  V, 
Academic  Press,  New  York,  1964 

[5]  Gelfand.  I.  M.:  Vilenkin.  N.  Ya. :  Generalized  Functions,  Vol.  IV;  Academic 
Press,  New  York,  1964 

[6]  Groetsch.  C.  W.:  The  theory  of  Tikhonov  regularization  for  Fredholm 
equations  or  the  first  kind;  Pitman,  Boston,  1984 

[7]  Louis.  A.  K.:  Tikhonov-Phillips  Regularization  of  the  Radon  Transform; 
Int.  Ser.  Num.  Math.,  Vol.  73,  211-223 


[8]  Ludwig,  D.:  The  Radon  Transform  on  Euclidian  Space;  Comm.  Pure  Appl. 
Math.  19,  1966,  49  -  81 

[9]  Natterer.  F.:  The  Mathematics  of  Computerized  Tomography;  Teubner, 
Stuttgart.  1986 

F lOl  Radon.  J.:  liber  die  Bestimmung  von  Funktionen  durch  ihre  Integral werte 
langs  gewisser  Mannigfaltigkeiten;  Ber.  Verh.  Sachs.  Akad.  Wiss.  Leipzig. 
Math.  Natur.  Kl.  69,  1917,  262-277 

[ll]  Smith.  K.  T.;  Solmon,  D.  C.:  Lower  Dimensional  Integrability  of  L2- 
Functions;  J.  Math.  Anal.  Appl.  51,  1975,  539-549 
fl2l  Tichonov.  A.  N.;  Arsenin.  V.,  Ya.:  Methods  of  Solving  Ill-Posed  Problems; 
Nauka,  Moscow,  1974 

[13]  Weidmann.  J.:  Lineare  Operatoren  in  Hilbertraumen;  Teubner,  Stuttgart, 
1976 


-J  — 

•  o 

oc  e»  u 


c  c 

•  I 

Is  -a 


0  m  a 

9  0  O. 

cr  n  w 

*1  ^ 


1  a  ~ 


!  I 

si  " 

i  \ 


l.  3 

u  M 

si  s 

si  2 


H  I 


u.  5  o  • 
M  £  N  S 


S  *■£ 
3  tS 

>s  U 

M  h 

h 

•  t*  •  w 

?r  v? 

If  || 

Si  3? 


>  s 
5  s 
SS  3 

*J  lm 

Sf  3 

21  3 

0  « 

«M  0  | 

o  U  o 
§1  o 

-2  43  >, 

w  u  u 

3  J  > 


*?  a  u 

0  c  0 

u  w  K 

JSJ*  * 


^  8  i 


i  H  n  s 


3  S 


*  i  3 


?  ?8 
5  H 

*-  H  •< 


s  : 

1  Is 

•  i  . 


a  a  c 

T  <*  “ 


u  r  o 
0  0  •*« 
ft.  — «  « 

>!  I 

h  I 

H  S 

3  * 

ti  h 


?§*®*2aaj**sss§§§§agg§ 


RH&I 


>.• 

B 

«;S 

m 


a 


i&'W 

,%'.N' v\v 

•w^ 


